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Abstract 

We show lower bounds in the multi -party quantum communication complexity model. In this model, 
there are t parties where the ith party has input Xi C [n]. These parties communicate with each other by 
transmitting qubits to determine with high probability the value of some function F of their combined 
input (X\ , X2 , . . . , Xt ). We consider the class of functions whose value depends only on the intersection 
of Xi,X2, . . . , X t ; that is, for each F in this class there is an fp : 2^ — > {0, 1}, such that 

F{x u x 2 , ...,x t ) = f F {Xx nx 2 n...n x t ). 

We show that the i-party fc-round communication complexity of F is n(s m (fp)/(k 2 )), where 
s m (fp) stands for the 'monotone sensitivity of fp' and is defined by 

s m (f F ) ± max \{i : f F (SU{i}) + f F (S)}\. 

SC[n] 

For two-party quantum communication protocols for the set disjointness problem, this implies that 
the two parties must exchange fl(n/k 2 ) qubits. An upper bound of 0(n/k) can be derived from the 
O(^fri) upper bound due to Aaronson and Ambainis (see also |BCW98| and |HdW02|). For k — 1, 
our lower bound matches the f2(n) lower bound observed by Buhrman and de Wolf |BdW01 1 (based 
on a result of Nayak |Nay99|), and for 2 < k <C n 1 / 4 , improves the lower bound of fl(^/ri) shown by 
Razborov [Raz02|. (For protocols with no restrictions on the number of rounds, we can conclude that 
the two parties must exchange f^n 1 / 3 ) qubits. This, however, falls short of the optimal Q(i/n) lower 
bound shown by Razborov |Raz02|.) 

Our result is obtained by adapting to the quantum setting the elegant information-theoretic arguments 
of Bar-Yossef, Jayram, Kumar and Sivakumar |BJKS02|. Using this method we can show similar lower 
bounds for the Coo function considered in | B JKS02 1 . 

1 Introduction 

Classical communication complexity: The communication complexity model of Yao [ Yao79 1 provides 
an abstract setting for studying the communication required for computing a function whose inputs are 
distributed between several parties. In its most widely studied version, there are two parties, Alice and Bob 
with inputs Xa,Xb Q [n], who exchange messages based on a fixed protocol in order to determine the 
value of some function F(Xa, Xb)- The goal is to design a protocol so that the parties need to exchange as 
few bits as possible. This model of communication is relatively well-understood (see the book of Kushilevitz 
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and Nisan [NK97 1) both in the deterministic and the randomized setting. In this paper, we will be interested 
in the randomized setting, where the parties are allowed to err with some small probability (say at most 
|). Tight lower bounds are known for several functions, in this model, for example, the equality function 

X A = X B IIYao79MLS8"Tl . the set disjointness function X A n X B = IKS92"l lRaz9"2l and the inner-product 
function \X A n X B \ (mod 2) ICG88I . 

Quantum communication complexity: The two-party quantum communication model (see Section l2~Tt 
was introduced by Yao [Yao93], in order to investigate if communication costs for computing functions 
distributively reduces significantly when the parties are allowed to exchange qubits and perform quantum 
operations locally. Since then, there has been a flurry of results in this model. We will be mainly inter- 
ested in the bounded error version of this model, where the two parties are allowed to err with some small 
probability (say at most |). It was observed early that for the equality and the inner-product functions the 
quantum model does not provide any significant savings: the complexity of the equality function is still 
6 (log n) [Kre95 1 and the complexity of the inner-product function is still 0(re) [Kre95 CvDNT98 ]. 

The set disjointness function: For the set disjointness function, however, quantum protocols were found 
to be strictly more powerful than their classical randomized counterparts. Since the communication com- 
plexity of the set disjointness function is central to the work presented in this paper, we describe its history 
in greater detail. In the bounded error classical setting Babai, Frankl and Simon [BFS86| showed a lower 
bound of Q(y/n). This was improved to an Q(n) lower bound by Kalyanasundaram and Schnitger [KS92]; 
their proof was simplified by Razborov [Raz92|. There is a straightforward protocol with n + 1 bits of 
communication where Alice sends her entire input to Bob, who computes the answer and returns it to Al- 
ice. Interest in the communication complexity of several problems related to the set disjointness function 
has been revived recently because of their connection to showing lower bounds in the classical datastream 
model IAMS991 IFKS02I lGGI+021 UndOOl IGMMO00I IJKS031 ISS021 . One of these problem is the 
promise problem: Alice and Bob are given inputs X A , X B £ {0, 1, ... , m} n , with the promise that either 
for all i G [n], |^aH — ^bWI < 1 or there exists an i G [n], such that \X[i] — Y[i]\ = m; they must 
communicate in order to distinguish between these two types of inputs. For this problem, Saks and Sun 
[SS02] showed a lower bound of f2(n/m 2 ) in a restricted model; their lower bound was strengthened by 
Bar-Yossef, Jayram, Kumar and Sivakumar [BJKS02|, who obtained the same lower bound without any 
restrictions. 

In the quantum setting, the set disjointness function was first addressed by Buhrman, Cleve and Wigder- 
son [BCW98|, who showed that there is a protocol for this problem with 0{y/n\ogn) bits of communica- 
tion. This bound was improved to 0(y/nc Xog n ), where c is a small constant, by Hoyer and de Wolf IHd W02L 
and recently to 0(y/n) by Aaronson and Ambainis IAA031 . By a result of Razborov [Raz02 1 this last bound 
is optimal. 

Multi-party classical communication complexity: There are several ways to generalize the two-party 
model to the multi-party model. In this paper, we will consider the version where there are t parties 
Pi , P2 , . . . , Pt with respective inputs X\ , X2 , . . . , Xt C [n\. In each round of communication some party 
sends a message to another party. The party who receives the last message can determine the desired value 
F(X\,X2, ... ,Xt) based on his current state at that point. Recently, because of its connection to the 
problem of computing frequency moments in the data stream model [AMS99|, the following promise set 
disjointness problem has been studied. Here, the parties are required to distinguish between two extreme 
types of inputs: in the first type, X\, X2, ■ ■ ■ , X t are pairwise disjoint; in the second type, X\, X2, ■ ■ ■ , Xt 
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have exactly one element in common but are otherwise disjoint. For this problem, Chakrabarti, Khot 
and Sun [CKS03| show a lower bound of U(n/(tlogt)), improving an earlier Q(n/t 2 ) lower bounds of 
Bar-Yossef, Jayram, Kumar and Sivakumar [BJKS02| and an Q,(n/t A ) lower bound of Alon, Matias and 
Szegedy [AMS99|. A slight variant of this problem, called the approximate set disjointness problem, was 
considered by Nisan [Nis02|; the lower bounds mentioned above apply to Nisan's version as well. The 
multi-party quantum communication complexity of these problems has not been considered before this 
work. 

1.1 Our results 

The upper and lower bounds on the two-party quantum communication complexity of the set disjointness 
function are tight up to constant factors, if there are no restrictions imposed on the number of rounds (i.e. 
the number of messages) in the protocol. The best upper bound uses 0(y/n) rounds of communication, and 
from it one can derive a k -round protocol where the parties exchange a total of at most 0(n/k) qubits. For 
k = 1, Buhrman and de Wolf [BdWOl | observed that the lower bound of Q(n) follows from the results 
of Nayak [ N ay99| for the index-function problem. For k > 2, Klauck, Nayak, Ta-Shma and Zucker- 
man IKN TZOll showed a lower bound of n l / k , but this is subsumed by Razborov's [Raz02| lower bound 
of £l{y/n) which holds even if there is no restriction on the number of rounds. However, for small k, 
Razborov's lower bound is far from the best upper bound known, namely 0[n/k). Our first result, gives 
lower bounds for the two-party /e-round communication complexity that comes closer to the upper bound. 

Result 1 The two-party k-round quantum communication complexity of the set disjointness function is 
n(n/k 2 ). 

In fact, this lower bound holds even if the protocol is only required to distinguish between disjoint sets 
and sets with exactly one element in common. Using easy reductions one can conclude that a similar lower 
bound holds for several other functions. A function F is said to be set disjointness-like if its value depends 
only on the intersection of Xa,Xb', that is, there is an fp : 2^ — > {0,1}, such that F(Xa, Xb) = 
Jf(Xa n Xb)- We obtain a non-trivial lower on the communication complexity of such functions F, if the 
underlying function fp has high monotone sensitivity: s m (fp) = max \ {i : fp(SU {i}) / fp(S)}\. 

SC[n] 

Result 1': The two-party /c -round quantum communication complexity of the a set disjointness-like function 

f is n( Sm (f F )/(k 2 )). 

For the promise problem we get the following. 

Result 2 The two-party k-round quantum communication complexity of the promise problem is 

0(n/(fc 3 m( fe+1 ))). 

We define a model for multi-party quantum communication complexity and show the following 1 . 

Result 3 The t-party k-round quantum communication complexity of the promise set disjointness problem 
is Q(n/k 2 ). [This lower bound also holds for Nisan's approximate set disjointness problem.] 

All our lower bounds hold even if the parties start with arbitrary prior entanglement that is independent 
of the inputs. 

'Our lower bound appears to contradict the 0(n/t) upper bound of |BJKS02|. This is because that upper bound is in the 
simultaneous message model, whereas in our definition of quantum protocols one is required to pass fixed length messages from 
one party to another. 
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1.2 Techniques used 



The original lower bounds for the set disjointness problem in the classical setting are based on deep analyses 
of the communication matrix and can be said to be based on the discrepancy method [ChaOO]. Razborov's 
recent £l(y/n) lower bound for quantum protocols also uses the discrepancy method. The discrepancy 
method for quantum protocols was formulated explicitly by Kremer [Kre95| (see also Klauck [KlaOl | and 
Yao [Yao93|), but Razborov's proof extends it substantially by developing interesting and powerful tools 
based on the spectral theory of matrices. 

Recently, however, Bar-Yossef et al. [BJKS02| proposed an information-theoretic approach for studying 
set disjointness-like problems in the classical setting. Using a refinement of the notion of information of 
communication protocol originally defined by Chakrabarti, Shi, Wirth and Yao ICS WCCY01I . they showed 
that a linear lower bound for the set disjointness problem follows from £1(1) lower bound on a certain in- 
formation cost of a two-party communication protocol computing the AND of just two bits! Their work 
provided a compelling and beautiful illustration of information-theoretic tools in the analysis of communi- 
cation protocols. 

We adapt their approach to the quantum setting. In order to bring out the contribution of this paper 
more clearly, we will now informally describe the information-theoretic argument underlying their proof 
and discuss how we adapt them to the quantum setting. The argument has two parts: in the first part, using 
a direct-sum argument for information from Bar-Yossef et al. [BJKS02], one reduces the set disjointness 
problem to a communication problem associated with the AND of two bits (one with Alice and one with 
Bob); in the second part, one shows that this problem on two bits is hard. 

The information cost approach: The first part of the argument is based on the notion of information 
cost of communication protocols, defined (by [CSWCCY01|) to be the mutual information between the 
inputs (which are assumed to come from some distribution) and the transcript of the protocol. Bar-Yossef et 
al. [BJKS02| examine the information cost of the protocol for several distributions. Let the number of bits 
transmitted by the protocol be c. Then, the information cost is also bounded by c for each distribution. 

At this point it will be convenient to view the inputs of Alice and Bob as elements of {0, 1} and the 
set disjointness function as VILi ^A[i] A A typical distribution considered by Bar-Yossef et al. is 

defined as follows. For each i, independently, one party is given the input and the other party is given a 
random bit. Using the sub-additivity property of mutual information, one concludes that the sum over i of 
the mutual information between the transcript and the input Xa [i] is bounded by c; a similar statement holds 
for Bob's inputs. It is then not hard to argue using a standard averaging argument that there is an % and a 
product distribution D*, for inputs (Xa[j], -^b[j] : J / i) such that the following conditions hold: 

• For all j / i, X A [j] A X B [j] = (with probability 1). 

• If Xa[i\ is set to zero and Xg[i] is chosen at random (and the remaining bits are chosen according 
to the product distribution D*), then the mutual information between the transcript and Xb[i\ is at 
most 2c/ n; similarly, if Xs[i] is set to and Xa[i\ is chosen at random (an the remaining bits are 
chosen according to the product distribution D*), then the mutual information between the transcript 
and Xa[t] is at most 2c/ n. 

From the first condition, by viewing (Xa[j], Xb[j] : j / i) as private random bits of the two-parties, we 
obtain from the protocol for set disjointness a protocol that computes the AND of the two bits Xa[i\ and 
Xb[z]. The stage is thus set for analysing the information cost of computing the AND function: a lower 
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bound of e on this quantity translates to a lower bound of Q(en) on the communication complexity of the 
set disjointness function. 

In order to implement this programme in the quantum setting, one has to define a notion of information 
cost for quantum protocols. It is not immediately clear how this can be done, because quantum operations 
are notorious for destroying the states on which they act; in particular, it is not reasonable to expect that the 
complete transcript of all messages is part of the final global state of the algorithm. Even if the complete 
transcript is available in the final global state of the algorithm, it may not contain any information about the 
inputs of either party. If the parties are allowed prior entanglement, then using quantum teleportation, one 
can implement any protocol such that the messages are classical and completely random. So, the transcript 
will just be a random string of length c independent of the actual inputs! 

The definition of information loss for quantum protocols: We address these difficulties by considering 
the information carried by each message separately. As observed above messages may themselves carry no 
information, so we examine the information carried in the message by including the context in which it is 
received. For example, consider a protocol for the AND problem. Fix some distribution for the inputs of 
Alice and Bob. We account for the information carried in a message sent by Alice to Bob, by considering the 
mutual information between Alice's input and the entire state of Bob, including the message just received. 
The information loss (we use the term loss instead of cost) of the protocol (for the given distribution) is 
defined to be the sum of these quantities (both for Alice and Bob) taken over all rounds. With this definition, 
the arguments of [BJKS02| are easily carried over to the quantum setting. We can then conclude that if the 
information loss of computing the AND of two bits is e then the communication complexity of the set 
disjointness function is Q(ne). 

We have arrived at the second part of the programme, that is, to show non-trivial lower bounds on 
the information loss of computing the AND of two bits. In the original argument of [BJKS02| this was 
achieved by a direct argument using certain distance measures between probability distributions. Since, we 
are working with a different notion of information loss, this argument does not appear to be immediately 
applicable in our case; so, instead of reviewing it, we will now directly describe our argument. We are given 
a quantum communication protocol for computing the AND function. We consider two kinds of inputs: 
first, Alice has and Bob has a random bit; second, Bob has a and Alice has a random bit. Suppose 
we are given that for such distributions at no stage does a receiver of a message gain more than e bits of 
information about the input of the sender. We wish to show that if e is very small, then this leads to a 
contradiction. Our argument can be understood at an intuitive level in the framework of round-elimination. 
Suppose, Alice sends the first message. We know that her message does not deliver much information about 
her input to Bob, that is, the combined state of Bob at the end of the first round is essentially the same 
when the Alice's input is and when Alice's input is 1. So, Alice might as well send exactly the same 
message in the two cases, and incur a small error in the correctness. That is, no matter what her actual input 
is, Alice sends her first message assuming that her input is 0. Since we allow prior entanglement, we can 
eliminate this round of Alice, and obtain a protocol with one fewer round of communication. Now, it is 
Bob's turn. Our hypothesis says that his second message does not deliver much information about his input 
to Alice, when her input is 0. But the modified protocol so far has proceeded as if Alice's input is (even 
though her actual input might be something else). We can thus eliminate Bob's first message as well. If e 
is small, then the increase in error probability on account of this manoeuvre is also small. Proceeding in 
this manner we eliminate all rounds. But it is obvious that if the parties exchange no messages they cannot 
compute any non-trivial function unless one allows huge error probability. Since, there are at most k rounds 
of communication, this gives us a lower bound of the form e > e(k). Using these ideas one can show an 
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£l(n/k 2 ) lower bound on two-party quantum communication complexity of the set disjointness function. 
There are two aspects of our proof that require further comment. 

Local transition: Recall the argument used above to eliminate Alice's first message. We know that Bob's 
state is roughly the same even if Alice generates her message assuming that her input is 0. However, this 
does not immediately imply that the error probability of the protocol is not changed much. The final answer 
is not just a function of Bob's state but the combined state of Alice and Bob. In particular, even though the 
Bob's state is similar after the first round for the two inputs of Alice, his work qubits might be entangled with 
Alice's qubits differently in the two cases. This problem arises often in round elimination arguments and by 
now standard solutions exist for it by considering the fidelity IJoz94l between quantum states. This allows 
Alice to perform a local transition [KNTZ01| on her work qubits, in order to restore them to the correct 
state should she discover later that her actual input is different from what was assumed while generating her 
first message to Bob. 

A paradox?: In our notion of information loss of quantum protocols it is important that the parties start 
in a pure global state. In fact, this notion is unsuited for classical randomized communication complexity. 
Consider the following classical protocol for computing the AND of two bits (a,b). Alice sends Bob a 
random bit r, retaining a copy of r if and only if a = 1. Bob sends Alice r © b; if a = 1, Alice can recover 
b using the copy of r she has and determine a A b. Now, clearly, the first message does not deliver any 
information to Bob. Furthermore, when Alice has a 0, Bob's message delivers no information about his 
inputs, because Alice does not retain a copy of r in this case. So, according to our definition this protocol 
has zero information loss for both the distributions considered above. Yet, the protocol computes the AND 
correctly! Interestingly, no such quantum protocol starting with a pure global state is possible. 

1.3 The rest of the paper 

In the next section, we give some the definition and notation used in the rest of the paper. In Section PJJ we 
prove Result n Result 2 and Result 3 also follow using similar arguments, but their proofs are not included 
in this abstract. 

2 Preliminaries 

2.1 Quantum communication 

We define t-party quantum communication protocols which are a natural extension of the two-party quantum 
communication protocols as defined by Yao [Yao93]. Let / : X\ x X2 ■ ■ ■ X t — > Z be a function. There 
are t parties, Pi, P2, ■ ■ • , Pt, who hold qubits. When the communication protocol II starts, Pj holds \xi) 
where xi G Xi together with some ancilla qubits in the state |0). These parties may also share an input 
independent prior entanglement (say \ip)). Different parties possess different qubits of l^)- The parties 
take turns to communicate to compute f(xi,X2, • • • , xt). Suppose it is Pi's turn to communicate to P2. Pi 
can make an arbitrary unitary transformation on her qubits and then send one or more qubits to P2. The 
number of qubits send is predetermined and is independent of the input x\. Sending qubits does not change 
the overall superposition, but rather changes the ownership of the qubits, allowing P2 to apply her next 
unitary transformation on her original qubits plus the newly received qubits. At the end of the protocol, 
the last recipient of a message performs a von Neumann measurement in the computational basis of some 
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qubits in her possession (the 'answer qubits') to output an answer U(xi,X2, • • • , xt). We say that protocol II 
computes / with <5-error in the worst case (or simply with error 5), if max xl!X2) ... jXt Pr[il(xi, X2, ■ ■ ■ , xt) / 
f(xi,X2, • • • , xt)] < 5. The communication cost of II is the number of qubits exchanged in II between 
all the parties. The -round <5-error quantum communication complexity of /, denoted by Q k s {f), is the 
communication cost of the best /c -round <5-error quantum protocol with prior entanglement for /. When 5 is 
omitted, we mean that 8 = |. 

We require that the parties make a 'safe' copy of their inputs (using, for example, CNOT gates) before 
beginning protocol II. This is possible without loss of generality because the inputs are in computational 
basis states. Thus, the input qubits of the parties are never sent as messages, their state remains unchanged 
throughout the execution of II, and they are never measured i.e. some work qubits are measured to deter- 
mine the result U.(x\, X2, • • • , x t ). We call such protocols safe, and henceforth, we will assume that all our 
protocols are safe. 

Suppose A, B, C are three disjoint finite dimensional quantum systems having some joint density matrix 

p. Let pa be the reduced density matrix of A. Then S(A) = S(p) = — Tr p log p is the von Neumann entropy 

of A. The mutual information of A and B is defined as I {A : B) = S(A) + S(B) — S(AB). The conditional 

mutual information of A and B given C is defined as I ((A : B) | C) = S{AC)+S{BC)-S{C)-S{ABC). 
If C is a classical random variable taking the classical value |c) with probability p c , it is easy to see that 
I ((A : B) | C) = s ^ JC p c I{A c : B c ), where {AB) C denotes the joint density matrix of A and B when 
C = |c). We also write I {A : B \ C = c) for I(A C : B c ). 

Fact 1 (see |CvDNT98|) Let Alice have a classical random variable X. Suppose Alice and Bob share a 
pure state on some qubits (a prior entanglement) independent of X. Initially Bob's qubits have no informa- 
tion about X. Now let Alice and Bob run a quantum communication protocol, at the end of which Bob's 
qubits possess m bits of information about X. Then, Alice has to totally send at least m/2 qubits to Bob. 

Fact 2 (Sub-additivity of information,see | KNTZ01 1) Let Dbea classical random variable. Let Xi, . . . ,X 1 

be classical random variables which are independent given D. Let M be a quantum encoding of X = 
Xl.. X n . Then, I{(X : M) \ D) > £? = i 7(pQ : M) \ D). 

Definition 1 (Trace distance) Let p, a be density matrices in the same finite dimensional Hilbert space. 
The trace distance between p and a is defined as follows: \p — a\\ t = Tr \/ (p — o~)^(p — a). 

Definition 2 (Fidelity) Let p, a be density matrices in the same finite dimensional Hilbert space H. Their 
fidelity is defined as B(p,a) = sup^i^\ u\ |(^|</>)|, where K, ranges over all finite dimensional Hilbert 
spaces and \ ip),\<j)) range over all purifications of p, a respectively in 7i®lC. 

Fact 3 (see |AKN98|) Let p, a be density matrices in the same finite dimensional Hilbert space TL Let T 
be a measurement (POVM) on 7i. Then, \\J-p — Fo~\\\ < \\p — 0"|| t . 

The following lemmas are derived in the appendix. 

Lemma 1 Let pi, p2 be two density matrices in the same finite dimensional Hilbert space TC, /C any Hilbert 
space of dimension at least the dimension ofTC, and \ <pi) any purifications of pi in 7i <g> /C. Then, there is a 

local unitary transformation U on JC that maps \<p2) to \<p' 2 ) = (J (8) U)\<p2) (I I s the identity operator on TC) 
such that 

||I0i)(0i| - !02><02l|U < 2-s/l - B( Pl ,p 2 ) 2 < 2^2(1 - B( Pl ,p 2 )). 
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Lemma 2 Suppose X and Q are disjoint quantum systems, where X is a classical random variable uni- 
formly distributed over {0, 1} and Q is a quantum encoding x — > a x ofX. Then, 1 — B(a±,a2) < I(X : Q). 

2.2 Conditional information loss 

Let D, Xa and Xb be random variables taking values in some finite sets V, Xa and Xb respectively. We 
say that D partitions X = (Xa, Xb) if for all d G V, Xa and Xb are independent conditioned on the event 
D = d. Given random variables X and D, the random variable (X, D) n is obtained by taking n independent 
copies of (X, D). Thus, (X, D) n takes values in (Xa x Xb) h which we identify with X\ x X^. Suppose 
D partitions X, and (X, D) = (X, D) n , then it is easy to verify that D partitions X. 

Definition 3 (Embedding) For a G A n , j G [n], and u £ A, let embed (a, j, u) be the element of A n 
obtained by replacing a[j] by u, that is, em bed (a, j, u)[i] = a[i]for i / j, and em bed (a, j, u)[j] = u. 

Definition 4 (Collapsing input) Suppose F : X n — ► Z. We say that x G X n collapses F to the function 
h : X — > Z if for all u G X, F(embed(x, j, u)) = h(u). We say that a random variable X taking values in 
X n collapses F to h if it is collapses F to h with probability 1. 

Definition 5 (Conditional Information loss) Let II be a two-party k-round 5-error quantum protocol for 
computing F : Xa x Xb — > Z. Let Alice star? ?/ze protocol and let A' l B l be the joint state of Alice and 
Bob just after the ith message has been received. Let X = (Xa, Xb) be random variable taking values in 
Xa x Xb which is partitioned by the random variable D. Then, the conditional information loss of U under 
(X, D) is defined by 

IL(II | (X, D)) k ^ =li . Qdd I(X A : B* | D) + ^ti, i even I&B A* \ D). 

The k-round 5-error conditional information loss of F under (X, D), denoted by IL^ ^F | (X, D)), is the 
minimum \L(U \ (X, D)) taken over all k-round 5-error quantum protocols Ufor F. [Note that 5 bounds 
the error for all inputs. In particular, this error bound applies even to inputs not in the support of X.] 

3 Lower bound for set disjointness 

Lemma 3 Let F : X\ x Xg — > Z. Let X be a random variable taking values in X = Xa x Xb; suppose 
X is partitioned by a random variable D taking values in some set T>. Let (X, D) = (X, D) n . Suppose X 
collapses F to the function h : X A x X B -> Z. Then, \lk,s(h \ (X, D)) < ^-Q k s (F). 

Proof: Suppose II is a £>round (5-error quantum protocol for F with total communication c. Let us assume 
that Alice starts the communication. Our goal is to show that there is a /c-round (5-error protocol for h 
with small information loss under (X, D). While analysing IT, we will need to maintain that the combined 
state of Alice and Bob is pure at all times. However, we will run II on random inputs drawn from certain 
product distributions. In such a situation, we will adopt the following convention. We will assume that 
in addition to the usual input registers IN^, Alice has another set of registers IN^. When we require that 
Alice's inputs be some random variable Xa, we in fact, start with the following state in the registers IN^IN^: 

J2 x ex A VPx\ x )\ x )i where p x = Pr[X^ = x]. Similarly, we simulate Bob's random input Xb in registers 
I N b and I N#. Then, we run the protocol II as before with input registers I N a and I N^- During this execution 
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no quantum gates are applied to registers IN^ and IN#. From now on (similarly X#) denotes the state 
of the registers IN^, which stays constant because the protocol II is safe. In this revised protocol IT', let 
A l B l denote the state of the entire system immediately after the ith message has been received (note that 
A % includes the register \Ha and B 1 includes IN^). Consider the execution of II' on input X = (X-a, X#) 
conditioned on D = d; note that under this condition X A and X^ are independent and the convention 
described above for simulating random inputs applies. Then, we have 

Vi, 1 < i < k, i odd, £"=i I((X A [j] : &) | D = d) < J((X A : B l ) | D = d) < 2c. 

The first inequality above follows from Fact |2] because by our definition of (X, D), (X^fj] : 1 < j < n) 
are independent random variables when conditioned on D = d; the second inequality follows from Fact^ 
Averaging over the possibilities for D, we obtain: < i < k,i odd, Ej=i I(0^a[j\ '■ B l ) \ 

D) < 2c. Similarly, we obtain Vi, 1 < i < k, i even, Y!j=i ^(( x s[i] : Ai ) I D ) < 2c - Summing these 
inequalities over all rounds i, we obtain 

E -=i (E^, , odd IPaW ■■ # I D) + Eti, i even I<*bV\ : A* | D)) < 2cfc, 
which implies: 

3j, 1 < J < n, Ef =1 , i odd H&M ■ Bi ) I D ) + Eli, i even I(fr B \j] : A<) | D) < ^. (1) 
Fix a value of j so that the last inequality holds. For d € V n , let 

Hd) = EL, , odd J ((X^[i] : B*) | D = d) + EjLi, i even '((Xiibl : A«) | D = d). (2) 

Then, from (Q, and the definition of conditional mutual information Ed [/(D)] < — . 

We will now obtain a protocol for h by 'embedding' its input as the jth input of II'. Using a straightfor- 
ward averaging argument we first fix a value d G V n so that 

2^Pr[£> = d]/(embed(d, j,d)) = E[/(embed(d, j, D)) < . (3) 

dev D n 

Consider the following protocol IT/j for computing H(ua, ug)- On input ua € Xa, Alice prepares her input 
registers as follows. In the registers (IN^[£], ll\U[^] : I ^ j) Alice places the superposition ExeA'™- 1 VPx\ x )\ x }-> 
where p x = Pt[(X.a[£] ■ £ ^ j) = x | D = d]; register ll\U[j] * s set to \ u a}- On input ub € Xb, Bob 
prepares his input registers in a similar fashion. Then, Alice and Bob apply the protocol IT, treating IN A 
and \ Hb as input registers. Note that IN A and IN^ do not exist in IT^. 

We need to verify that this protocol for computing h has two properties. First, it computes h correctly 
with high probability. For this, we note that in this protocol, at all times, the state of the registers that were 
present in the original protocol II (that is all registers except IN A and IN^) is identical to their state when 
the original protocol II is run with input embed(X, j, (ua,ub)) conditioned on the event D = d. Since X 
collapses F to h, we conclude that 11^ computes h(uA,us) with probability at least 1 — 8. 

Second, we need verify that IL(n^ | (X, D)) is small. We expand the LHS of (J3} using the definition 
© of /(d) and show that each term in it is at least the corresponding term in I L ( TT/j | (X, D)). For example, 
consider the term I(X A : B i \ (D = d)) in the definition of \L(U h \ (X, D)). Note that the state (X A , B l ) 
of Hh on input X conditioned on D = d, is identical to the state obtained from (KaU], B l ) of II' by 
omitting the register IN^[i], when II' is run on input X conditioned on D = embed(d, j, d). It follows 
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from the monotonicity property of information that I{Xa '■ B % \ (D = d)) is at most J(X/i[j] : B l | (D = 
embed(d, j, d))). We can then conclude (details omitted) that IL^, (X,D)) < ■ 
As in [BJKS02|, let D be a random variable taking values in {A, B} uniformly. Let Xa, Xb = {0, 1} 
and X = (Xa, Xb) be a random variable taking values in Xa x Xb = {0, l} 2 , whose correlation with D is 
described Pi[X = 00 | D = A],Pt[X = 10 | D = A],Pt[X = 00 | D = B],Pv[X = 01 | D = B] = ± 
It is clear that conditioned on D = A and D = B, Xa and Xb are independent. Note that X n collapses 
D IS J to AND . We now show a lower bound for the conditional information loss of AND under (X, D). 



Lemma 4 Let (X, D) be as above. Let e > 0. Then IL fci 5(AND | (X, D)) > 



(l-2c) 2 
4fc 



Proof: Let II be a fc-round e-error quantum protocol for AND with rj = IL(I1, (X, D)) = IL^ ^(AND | 
(X, D)). Consider the situation in II just after the ith message has been sent. Let m 1 denote the qubits 
of the ith message. Let Xa,Xb denote the random variables corresponding to Alice's and Bob's inputs 
respectively in II. Suppose (Xa,Xb) = (x,y). Let \(fi xy } be the global state vector of Alice's and Bob's 
qubits, and let A l ,B l denote Alice's qubits and Bob's qubits respectively at this point in time. Suppose 



m 1 is sent from Alice to Bob. Then B % 
disjoint sets of qubits. Let a l xy , (3 xy 

c B = I(X : B i | D) and c\ = I(Y : 
Alice applies to A t ~ 1 after receiving the (i 

iy) = U^W' 1 
1 „i-l 



B l ~ L U m 1 and A 1 



A 1 U m l 



denote the reduced density matrices of A\ B % in the state \<p xy ) 



Then, \4 

B(a { 



00 ' a 01 

we write 



< I((Y : A' 



and the unions are over 
. Define 

A % | D). Let U Al 1 denote the unitary transformation that 
- l)st message from Bob, in order to prepare the ith message. 
By LemmaEl 1 - B(f3 l 00 ,p[ ) < I{(X : B { ) \ Y = 0) < 2c B and 1 - 
x ) | X = 0) < 2c^~ 1 . To keep our notation concise, for state vectors \<j>) and 



instead of 



t . By Lemma[0 there exist unitary transformations 



V 0Q- 



•01 



acting on B 



i-l 



and V A [ 



, 10 acting on A 1 such that 



Vr 



Define 8i-\ 



oo^oi I ^oo 1 ) 

A 



^^X 1 ) 1 ' 2 and 



^00^10 1 



/oo 



i a/2 



(4) 



Vr 



00^011*^10 



Using the unitary invariance and triangle inequality of the 



trace norm, the fact that unitary transformations on disjoint sets of qubits commute, and ©, 



< 



Vr 



00^10 1 V01/ 



J lll 



00^01 ) 1 ^00-»l li !' 



I4i> - (^oi)- Vh>) 



Vr 



00^10 (^00^01 ) 1 |^0l) (K)0— I ''11/ 



\<t>\ 



V 



rB i 



00^10 -^01 1 



- 1 ^ 1 |^oI 1 ) 



(Vr 



Vr 



A 1 
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u- 



Vr 
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u- 



(Vr 



B 1 - 1 \ 
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00^01 
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00—01 / 



K 00-*10 u 



A 



V A1 U A 

^OO^lO 17 
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V 
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u Al 
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+ 



10 / 



+ 



^oo^iol'Aoo) 



^oo 1 1 



y 10 



^io) 



00^01 1 ^00 



+ 



Vr 



00^101^00/ 



'vai 



< 4(4 1 ) 1 /2 + 4 (c^) 1 /2 + (5 ._ 1 . 



y ll 



+ 

- U A 
+ 



00^01 , 



(Vr 



+ 



Vr 



B' 



00^01 1^10 



00^01 / 



it 1 : 
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It is easy to check that Sq = 0. Hence using concavity of the fourth root function, 5 k < 4fc (?) . Now 
a correct k -round e-error protocol for AND must have (from Fact |3] and using the fact that a local unitary 
transformation does not affect the density matrix of the remote system), 6k > \\Pio ~ Pn\\ t > 2 — 4e. 

Hence, rj > . ■ 

The following is now immediate from Lemma|3]and LemmalU 

Theorem 1 Any two-party k-round bounded error quantum protocol for the set disjointness problem needs 
to have communication cost at least Q (ipr). 

Corollary 1 Any two-party bounded error quantum protocol for the set disjointness problem needs to have 
communication cost at least f2 (n 1 / 3 ). 
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A Quantum information theory background 



In this section we give some basic quantum information-theoretic definitions and facts which will be useful 
in stating and proving our main results. For an excellent introduction to quantum information theory, see the 
book by Nielsen and Chuang I NCOOII . 

Suppose P,Q are probability distributions on the same finite sample space [k]. Their total variation 

distance is defined as follows: \\P — Q\\i = X^ie[fc] l-^W ~~ Q{i>)\- The quantum generalisation of the total 
variation distance of a pair of probability distributions is the trace distance of a pair of density matrices. 
Recall that a density matrix over a finite dimensional Hilbert space TL is a unit trace, Hermitian, positive 
semidefinite linear operator on TL. 

Definition 6 (Relative Entropy) If p,cr are density matrices in the same Hilbert space, their relative en- 
tropy is defined as S(p\\a) = Tr (p(log p — logo - )). 

Let p be a density matrix in a finite dimensional Hilbert space TL. Suppose T is a measurement (POVM) 
on TL. Then Tp denotes the probability distribution on the (finite number of) possible outcomes of T got by 
performing the measurement T on the state p. The following fundamental facts (see [AKN98]) show that 
both the trace distance and relative entropy only decrease on performing a measurement. 

Fact 4 Let p, a he density matrices in the same finite dimensional Hilbert space TL. Let T be a measurement 
(POVM)onTL. Then, \\Tp-Ta\\i < \\p-<r\\ t . 

Fact 5 Let p, a be density matrices in the same finite dimensional Hilbert space TL. Let T be a measurement 
(POVM)onTL. Then, S(Fp\\F(r) < S(p\\a). 

Jozsa [ Joz94 1 gave an elementary proof for finite dimensional Hilbert spaces of the following basic and 
remarkable property about fidelity. 

Fact 6 Let p, a be density matrices in the same finite dimensional Hilbert space TL. Then for any finite 
dimensional Hilbert space fC such that dim(/C) > dim(TL), there exist purifications \ip),\<f)) of p,a in 
TL&KL, such that 

B{p,a) = M4>)\. 

Also, 

B {p,o) = HvW^llt • 
We will also need the following result about fidelity, proved by Fuchs and Caves [FC95|. 

Fact 7 Let p, a be density matrices in the same finite dimensional Hilbert space TL. Then 

k 

B(p,a) = inf Y^^WV^W), 

1=1 

where {F\, ... , Fk} ranges over POVMs on TL. In fact, the infimum above can be attained by a complete 
orthogonal measurement on TL. 

The following relation is known between fidelity and trace distance between two density matrices [NC00|. 
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Fact 8 Let p, a be density matrices in the same finite dimensional Hilbert space TL. Then 

2(1- B(p, a)) < \\p -a\\ t < 2y/l - B(p,a)*. 
The following information-theoretic facts follows easily from the definitions. 

Fact 9 Let X be a classical random variable and M be a quantum encoding of X. Let X take the values 
1, . . . , I with probabilities pi,...,pi and let o~\, . . . ,o~i be the respective density matrices of M. Let a = 
Y2j=iPj a j b e me average density matrix of M. Then, I(X : M) = Y2j=iPjS(o'j\\o~). 

B Improved Average encoding and Local transition theorem 

In this section, we observe that the following lemma from [DCHR78| can be used to improve the average 
encoding and local transition arguments of [KNTZ01 ]. If Lemmas |5] and are used in their place, the factor 
k 4 in the denominator of some existing lower bounds (e.g. [KNTZ01 ] and [ JRS02]) can be replaced by k 2 . 

Lemma 5 Let p and a be two density matrices such that S(p\\a) is finite. Then, 

B{p,a) > 2- s ("H/ 2 . 

Proof: Let M be the complete orthogonal measurement which achieves the infimum as in the Fact0 Let 
P and Q be the classical distributions resulting after the measurement M is performed. From Fact [5] and 
concavity of the log function it follows that: 

-(l/2)S(p\\a) < -(1/2)S(P\\Q) = J>log7^M 

i 

< log ^2 VQiPi 
i 

= log B(P,Q) = log B(p, a). 

■ 

Corollary 2 Let p and a be two density matrices such that S(p\\a) is finite. Then, 

l-B(p,a)<((]n2)/2)S(p\\a). 

Proof: If ((In 2)/2)S(p\\ a) > 1 then the inequality is trivial since B( , ) > 0. Therefore when ((In 2)/2)S(p\\a) < 
1, 

B(p,a) > 2~ s ^/ 2 

> exp -((ln2)/2) 1 S(p||<7) 

> 1 - {(\n2)/2)S{p\\a) (since exp' x >l-x, for < x < 1) 
^1-B(p,a) < (Qn2)/2)S(p\\a). 

■ 

The following lemma follows immediately from the above corollary and Fact|9l 
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Lemma 6 (Average encoding theorem) Suppose X, Q are two disjoint quantum systems, where X is a 
classical random variable which takes value x with probability p x , and Q is a quantum encoding x ^ a x 
of X. Let the density matrix of the average encoding be a = Y2xP xCTx ' Then, 

Y,p x (l - B(p,p x )) < (ln2/2)/(X : Q). 

X 

The following lemma follows immediately from Fact|6]and Fact[8]and Corollary |2] 

Lemma 7 (Local transition theorem) Let p\ , p2 be two density matrices in the same finite dimensional 
Hilbert space H, K, any Hilbert space of dimension at least the dimension ofTC, and \(pi) any purifications 

of pi in TL (g) /C. Then, there is a local unitary transformation U onK, that maps \4>2) to |(/>2) = (1 U)\4>2) 
( I is the identity operator onTi) such that 

lll&Wll - l02><02l|U < 2y/l-B( Pl ,p2)* < 2^2(1 -B( Pl ,p 2 )) < 2y/]n2(S( Pl \\p2)). 

Fact 10 (|Lin91|) Suppose X and Q are two classical correlated random variables, where X is uniformly 
distributed over {0, 1} and Q is an encoding x — > P x of X. Then, 

1-B(P 1 ,P 2 ) <I(X:Q). 

Following corollary is immediate from FactQand monotonicity of information, 

Corollary 3 Suppose X and Q are disjoint quantum systems, where X is a classical random variable 
uniformly distributed over {0, 1} and Q is a quantum encoding x — > a x of X. Then, 1 — B(ai, a 2 ) < I(X : 
Q). 
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